ICA-based feature extraction for phoneme recognition

نویسندگان

  • Oh-Wook Kwon
  • Te-Won Lee
چکیده

We propose a new scheme to reduce phase sensitivity in independent component analysis (ICA)-based feature extraction using an analytical description of the ICAadapted basis functions. Furthermore, since the basis functions are not shift invariant, we extend the method to include a spectral-domain ICA stage that removes redundant time shift information. The performance of the new scheme is evaluated for TIMIT phoneme recognition and compared with the standard mel frequency cepstral coefficient (MFCC) feature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme recognition using ICA-based feature extraction and transformation

We investigate the use of independent component analysis (ICA) for speech feature extraction in speech recognition systems. Although initial research suggested that learning basis functions by ICA for encoding the speech signal in an e5cient manner improved recognition accuracy, we observe that this may be true for a recognition tasks with little training data. However, when compared in a large...

متن کامل

Spectral Analysis of Speech: A New Technique

ICA which is generally used for blind source separation problem has been tested for feature extraction in Speech recognition system to replace the phoneme based approach of MFCC. Applying the Cepstral coefficients generated to ICA as preprocessing has developed a new signal processing approach. This gives much better results against MFCC and ICA separately, both for word and speaker recognition...

متن کامل

Integrated Phoneme Subspace Method for Speech Feature Extraction

Speech feature extraction has been a key focus in robust speech recognition research. In this work, we discuss data-driven linear feature transformations applied to feature vectors in the logarithmic mel-frequency filter bank domain. Transformations are based on principal component analysis (PCA), independent component analysis (ICA), and linear discriminant analysis (LDA). Furthermore, this pa...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Speaker Independent Isolated Tamil Words for Speech Recognition using MFCC, IPS and HMM

The process of converting an acoustic waveform into the text resembling the information, conveyed by the speaker is termed as speech recognition. Nowadays, normally Hidden Markov Model (HMM) based speech recognizer with Mel Frequency Cepstral Coefficient (MFCC) feature extraction is used. The proposed speech feature vector is generated by projecting an observed vector onto an Integrated Phoneme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004